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REGULARITY FOR A LOG-CONCAVE TO LOG-CONCAVE 
MASS TRANSFER PROBLEM WITH NEAR EUCLIDEAN COST 

MICAH WARREN 



Abstract. If the cost function is not too far from the Euclidean cost, then 
the optimal map transporting Gaussians restricted to a ball will be regular. 
Similarly, given any cost function which is smooth in a neighborhood of two 
points on a manifold, there are small neighborhoods near each such that a 
Gaussian restricted to one is transported smoothly to a Gaussian on the other. 

< 

^ . 1. Introduction 



h^ , This note deals with the regularity of the optimal transportation map, when 

f~^ ' the distributions under consideration are close to restricted Gaussians. From the 

On . work of Ma, Trudinger and Wang, ([MTW], [TW]) regularity holds for arbitrary 

smooth distributions on nice domains when the cost satisfies the MTW A3s con- 
dition. It is established by Loeper [L] that without this MTW condition on the 
cost function, one cannot expect regularity for arbitrary smooth distributions, and 
(^ [ the question of regularity is wide open. Here we show that we can find smooth 

optimal transportation, at least for some very nice distributions. 

We show two results. The first is that when the transportation problem involves 

K^ I distributions somewhat like the standard Gaussian restricted to the unit ball, then 

; I ■ if the cost function is close enough to the Euclidean distance squared cost, the map 

5t 1 must be regular. As a corollary, given two points and any cost which is smooth 

near these points, we can find very focused Gaussians, restricted to very small balls 

near the points, so that the optimal transport is regular. 

Our method yields a way to compute precisely how close the cost function need 
be to Euclidean, or relatedly, how small the balls must be around the given points. 
Recently other perturbatitive results for regularity of optimal transport have ap- 
peared: Delanoe and Ge [DG] show regularity for certain densities on metrics near 
constant curvature. Caffarelli, Gonzalez and Nguyen [CGN] present estimates, 
when the cost is Euclidean distance raised to powers other than 2. 
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2 MICAH WARREN 

Specifically, let /, / be functions on regions 57, J7 C M", satisfying on fl 

(al) \Df\ < 1 

(a2) 1<5 < D'\f < 2 

(a3) \D'f\ < 1 

and similarly for / on f2. 

Wc define the following mass distributions 

(1.1) m = e--^(")xo 

(1.2) m = e-/»xa 

where we may add a constant to / so that both distributions have the same total 
mass. 

The region 17 will be required to have a defining function h so that on 17 = 
{h < 0} , h satisfies the same three conditions (al-3) as /, as well as, along the 
boundary dU, 

(1.3) \Dh\ >l/2, 

which implies the second fundamental form of the set dVt = {/i = 0} is bounded by 
4. Similarly define an /i, f2. 

A solution of the optimal transportation equation for these densities and a given 
cost function c(a;, x) is a function u{x) which satisfies 

(1.4) detwy =e-^(^)e-''(^(^'^"»|detc,,(a;,T(x,i:'u))| 

(1.5) T{x,Du){VL)^n 

where 

(1.6) Wij = Uij{x) - Cy (x, T{x, Du)) = CsiT^ 
and T{x, Du) = (T^, T^, . . . , T") C 17 is determined by 

Ui{x) = Ci{x,T{x,Du)). 

(Such a solution must also be c-convex. In our setting, the two notions of convexity 
are very close, so we won't belabour this point here, see Lemma 12.61 ) We will use 
the following convention: The derivatives of the cost function in the first variable 
X will be I, j, k etc. The second variable x will be denoted by indices p, s, i, etc. 
Also upper index denotes inverse i.e c*^ = {cis)~^ ■ 

The cost c{x,x) will satisfy the standard conditions (Al) and (A2) but not 
(A3) (see for example [MTW] section 2.) We will require further that the second 
derivatives of the cost satisfy the following assumptions 

(c-al) ll(c"'--^)|| <eo < 1/20 

(c-a2) C{n) {\\D^c\\ + \\D'^c\\) < eq < 1/20 
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where C{n) is a dimensional constant, and the derivative norms are with respect 
to both barred and unbarred directions. Finahy we wih require that the densities 
are somewhat close to uniform 

(cm-a3) e-^(^)e^"(^)|detCi,| G [A-\A] 

for all x, x 6 r2 X J7 with 

(cm-a3b) ^^fe)"- 

We are now ready to state our result. 

Theorem 1.1. Let m, m be the mass densities defined by il.l]) il.2\) with /, / sat- 
isfying assumptions (al-3) on regions Vi, i7 whose defining functions also satisfy 
(al-3). There exists an eo(ri) such that if the cost function satisfies standard as- 
sumptions (Al) and (A2) and (c-al,a2) and (cm-aS) hold, then the optimal map 
transporting m to fh is regular. 

Remark 1.1. These conditions are nonvacuous. For example take f,h,f,h all to 

be 

2, ,2 1 

and 

c{x, x) — ~X ■ X. 

One can check that all the assumptions are satisfied with plenty of room to perturb 
any of the problems components. 

The following theorem will follow by a change of coordinates and rescaling. 

Theorem 1.2. Let xq^xq he two points in manifolds X,X such that near (a;o,a;o) 
the cost function is smooth and satisfies standard nondegeneracy conditions (Al)(A2). 
Then there exists a A large depending on the cost function, so that the optimal map 
from the Gaussian (after a choice of coordinates) 



-\^\x~xa\ /2 

to 



e ' "' ' Xb„^(xo) 



-\^\x-xo\ /2 

is smooth. 



e ' "' ' Xb„^(xo) 



Remark 1.2. We do not attempt to obtain any sharp results, rather the convenient 
smallness assumptions are to minimize crunchiness of the proof. Inspection of the 
proof will show that our choice of assumptions are robust. There is a rather large 
gap between what is covered here and the counterexamples, and we have no reason 
to suspect that these results are near sharp. 

Remark 1.3. We would like to obtain a similar result for complete Gaussians, as 
Caffarelli obtained in the Euclidean case in |C2] . In fact, it was an attempt to 
generalize the calculation in }C2j that led to this result. A limitation of our current 
method is that we cannot force (cm-a3) to hold on large regions. 
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1.1. Proof Heuristic. We will solve the problem by continuity, starting with Eu- 
clidean cost, obtaining second derivative estimates using the approach of Urbas [U] 
and Trudinger and Wang |TW| , making use of the Ma, Trudinger and Wang |MTW| 
calculation together with the calculation of Caffarelli |C2| . Making these methods 
work in the absense of the MTW condition, we use the following observation: The 
bound M on the second derivatives will satisfy the following type of inequality 

(1.7) (5A/2 - iM"+i - 1 < 0. 

When t is zero, this bounds M, so M is initially bounded. If t is small it follows 
that M{t) lies either on a relatively small compact interval containing [—1/2, 1/2] 
or on a noncompact interval. The bound M{t) is changing continuously with i, 
thus the interval it lies in must not change, thus from the initial bound we may 
conclude that for all t in some interval of fixed size, M{t) is bounded. 

The quadratic coefficient 5 in (|1.7|) (same 5 as in (a2)) arises when the target 
distribution is log-concave, as is the case with Gaussians. This fact is essential to 
the proof. 

2. Calculations 
Recall the symmetric tensor if (|1.6|) . We use the quantities defined as follows 
W{x) = y Wii ^ maxuJij ^ \Tj\ 
W{x) = y^ w" ^ I /mm Wii 

C3 > \\D^c\\C{n) 
Ci > \\D^c\\C{n) 

From (cm-a3) and Newton-McLaurin inequalities, it follows that 
(2.1) w,W>n-^ 



AiA 



(2.2) W < -^AW"-'^ 

71" ^ 

(2.3) W < ^-^Aiy""! 
and pluggin in (cm-a3b) 

(2.4) W,W>3/2. 

Notice that (al)(a2)(ca-l)(ca-2) imply the following inequality for any vector in R" 

(2.5) {hst-c''''cksthp)^s^t>^m^. 

Throughout this section wc will be assuming we have a smooth solution u to the 
equation (|1.4p on fl. Our goal is to prove second derivative estimates. 

We make use of the linearized operator at a solution u, from [TW| defined by 
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The following has an immediate consequence when maximums occur on the 
interior, and is also crucial in the boundary estimates in Section 4. The proof is 
a moderately long calculation and follows by the arguments in jMTWj . 

Lemma 2.1. Suppose u{x) is a solution to jl-jV - Then 

Lwii = 

= w'^ [2c,j,iTl + c,,stTlTl - 2ciusTJ - custTiT^] 
+ JstTlTl - /ii + e'{c,sii + 2c^stiTl + CrstpTlTl) 

+ (cr + crri)(c.si + c..prf) 

ij 
-W{W,jl. 

Applying the maximum principle, 

Corollary 2.2. If the largest eigenvalue W of w is attained on the interior, it must 
satisfy 

(2.6) —W^ ~ (C4 + C3 + C3IA/I) W^"+' - l^'/l - C(C3, C4) < 0. 

The next computation is implicit throughout |TWj sections 2,3 and 4. We state 
it for concreteness. 

Lemma 2.3. Let v{x) = F{x,T{x,Du)). Then 

(2.7) Lv = w'^F,, + 2F,sc" + F,tc''c>*w^, 

+ Fp {-cP^fk - cP''cks,jC^' - CkstcP''c''c>'w,,) 
- Fk {w^'c^.^c'" + fsc'" + c^'c^.tc'") . 

Corollary 2.4. Given conditions (c-al) (c-a2) and (al) (a2) on the functions f,f, 
h, and h, we have 

q - n 

Lh> —6W 

- 10 10 

Lh{T{x,Du))>^SW-^. 



2.1. Obliqueness. We follow the argument from [TW] section 2. Defining 

j^ Dh 

P = hsC'^dr 

we let 



6 MICAH WARREN 

From Lemma 12.31 with our assumptions we have 

Lx < W{\D^h\ + C3 + C4) + W{\D'^h\ + C3 + Ci) + C5(n). 
Then Coroharv 12.41 gives 

L{x-\h^\ho T{x))] < W{^^ - ^A) + Ty(^ - ^A) + 2^^X + C,{n) 



which is negative for A reasonably chosen. (Throughout we are using bounds (j2.ip 
etc, and our initial assumptions.) This function will then have a minimum at the 
boundary, precisely at the point where x achieves a minimum on the boundary, and 
at this point we have 

{Dx - \D{h oT)- \Dh] • -py < 

or 

(2.8) D{x-\hoT]=T^ 

for some r < A. 

Now computing (following |TW[ 2.31-2.33]), using (|2.5p and (|1.6p with our other 
assumptions including (jl.3p we conclude 



Dx-P^ c'% {huc'^K + hu (cf + cfTf) hs + hkc'^KpTf) 

= huP^'P' + C'%hkhscf + c'%hkT[{hspC''' - hsC'"'c^''Cmrp) 
(2.9) = hk^p'^P' + c'%hkhscf + hkhtTyc^'^ihrp - hsC^'^C.nrp) 

>\P\^6~C3>l6, 



The third term in p.9p can be expressed as an inner product g of the gradients of 
the functions h(x) and h o T{x), which are both multiples of the outward normal, 
where 

g{^,iy) = (hrp - hsC'"'c^rpV^C^'"S.kVa- 

Thus 

T-f- P^Dx- P~ \D{h oT)- P 
> (5/5 - XhsTtc'% 

= 6/5- Xw/3i3. 

Thus from r < A, 

(2.10) \x>S/5-Xw0p. 

Using symmetry (replacing all quantities with barred quantities we find the problem 
does not change, again see [TWj and Lemma 12. 6p , we may assume 

(2.11) Ax > S/5 - Xw^j. 
Then, using the Urbas formula [U], [TWl 2.13] 
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;ave 


or 










(2.12) 






X^ = "^77 


W/3/3 


we have combining 


(|2.1()|) (12, 


11^ 


and (|2.12|) 




(2.13) 






5 
^-lOA 


= 6*. 



Corollary 2.5. The following holds, regarding the angle between /? and j 

Z(/3,7)<A<7r/2. 

2.2. cost-convexity. 

Lemma 2.6. Suppose u{x) is a solution to {1-4^ on a domain in R". If D^u > 
2eo, and the cost function differs from the Euclidean cost function by less than then 
eq in C^, then u is c-convex, and the mapping T[x,u) is one to one. 

Proof. Suffice to consider the c = —x ■ y + (f>{x, y), where (p is small in C^(51). At 
a point xo, we have Du(xq) = Dc {xo,T{xo, Du)) = —T{xo, Du) + D(J3(xq,T(xq)). 
At another point, xi 

{Du{xi) — Du{xo), xi — xo) > 2eo|a:i — xqI^. 

Now suppose that u is not strictly c-convex. Clearly the issue would have to be 
nonlocal, as locally, 

D'^u - D'^c > 2eo - eo > 0. 
Thus we can assume that there is a point xq and a locally supporting cost lunction 

Cyo i^) = -X -TixQ) + (f>{x, T{xo)) 

which contacts u from below near xq but touches u (possibly transversely) at a 
point Xi. It follows that 

(DCy„{xi) - DCyg{xo),Xi - Xq) > {Du{xi) - Du{xo),Xi - Xq) 

that is 

||D^(/)||^i_i |a;i - xop > 2eo|a;i - xqP 
a contradiction. It follows that u is c-convex and T is one to one. D 



2.3. Boundary Estimate. Let 

M = max Wee 

be the maximum of all eigenvalues W over all of fl. Throughout this section we 
will assume that the maximum occurs on the boundary. 

Recalling (|2.3p and Lemma [^751 we may choose a Cg so that 

L{CeM"-^^"~^h - h{T{x,Du)) > 0. 

Since h, h both vanish on the boundary, the derivatives must satisfy 

Dph o T{x, Du) < C6M"-2/«-i 
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that is 

Lemma 2.7. At a point xq on the boundary dQ, suppose Wee < M for unit direc- 
tions e which are tangential to the boundary. If z is any vector in T^^^, then 

1 

w,z <M\z\- + 

where 



w,, <M\zf + —{z,\7hfwpp. 



z = z -P = z-y, 

and is defined by i2.13\) . 

Proof. Dotting with 7 verifies z is tangential, thus 

= dzh o T(x, Du) = hsTjZj = hsd^WijZj. 



Now 
but 



Wzz = Wii + 2Wzy + Wy 



Wzy = WijZjhsC^" = 



Wzz <M|z|^+ ( ^j w^^. 



D 



Now suppose that the maximum tangential derivative wn = AI^ happens at a 
point .To, where ei is a tangential direction. Define the function 

77 = wii - M^\eiix)\^ - C6-^(ei, V/i(a;)}2Af"-2/„-i ^ ^^^^^ ^ ^^^j^ + hoT) 
where 

"^^(")" - ''- aUT)c^^h,ix,T)f 
with ^ a smooth function satisfying ^(t) = t for t > 9/2, and ^(t) > 6'/4. Now 
computing, using Lemma |2. II and (|2.2p 

L77 > (5u;2 _ (^-4 + C3) W^VP^2 - C(n) - M |i|ei(x)p|) - Ce-^li (d, V/i(x))2|M"- 
+ C7(Af + 1)|^(5(M^ + P^) -2(1 + //)| 

and using (considering Lemma l2.3p 

|L|ei(T)p| <C8(M^ + 1 + W^) 
|LC6(ei,V/i(a;))^| < ^(M^ + 1 + W^). 
we may choose 

C7 = C8 + (C4 + C3)(A7 + M) 

so that 

Lri>0. 



LOG-CONCAVE 9 

Next we show a lower bound on Djswh^xq). First, observe that due to Lemma 
I2.7| rj has a maximum at xq- It follows from the Hopf maximum principle that 
Dri-P = vj-P > 0. Thus (recalling /ii(a;n) = 0) 

(2.14) 

- {Cg + (C4 + C3) (M + M) } M{Dph + DpH) 

> ~C{n)M^ ~ {Cs + (C4 + C3) {M + M)}Ce{n)il + M^^-s/n-iy 

Finally we will derive a relation between the maximum M of all eigenvalues 
of w and for tangential eigenvalues M^. Go to the point where the maximum 
of all eigenvalues for w happens. (Again, in this section we assume this happens 
along the boundary.) We diagonalize w = diag{M, A2, . . . A„) with respect to some 
coordinates ei^ . . . e„, choosing ei • 7 > 0. Now 

wpp = (/3 • e,fM + (/? ■ e2fX2 + ...{(3- e,,fK < C6(n)A/"-2/"-i 

thus 

(2.15) (/3-ei)2<C6(n)Af-i/"-i. 

It follows that there is a Cio depending on CQ{n) and A, (recall Corollarv l2.5p such 
that if M > Cio, then 

|Z(/3,ei)-V2|<i(7r/2-A) 

in particular 

^(7,ei)>i(7r/2-A). 

Thus the length of projection of the maximum eigenvector of w onto the tangent 
plane is at least some value aM depending on A. So we may assume that either 
M < Cio, or the maximum tangential value M'^ satisfies M'^ > aM. 

Proposition 2.8. Suppose that the global maximum for w is attained along the 
boundary. Then if M > Cio, M must satisfy 

(2.16) Af2 - (C4 + C3) A/"+^ < Cii 
Proof. Differentiating h oT{x, Du) twice tangcntially, 
(2.17) 

dnhoT{x,Du)) = h,T^,+hstT^Tl = -(V/i o T,//(l, 1)) 

= hp {cP'wn,k + cP^cn^sT^ " c^''cfci,,Tf - cP''cks,iT^ - CkstC^'^TlTl) 
(2.18) + hstc''w,ic''wji 

using |MTW1 4.11]. Now using hpd'''wi\^k = wwji, p.l4p and the discussion in 
the previous paragraph we conclude that if M > Cio, 

6aM^ ~ C{n)MT - {Cg + (C4 + C3) (Af + M) } C6(n)Af ^"-3/n-i _ (^^^2 

< CfiAf"^^/"^^ 
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Using Young's inequality to clean up the expression, we have 
(2.19) M2-(C4 + C3)M"+i <Cii. 

D 

3. Proof of Theorem 

We now go through the alternatives and make our choice of constants, in order 
to bound w and consequently D^u. 

First, if the maximum happens in the interior, then (|2.6p 

(3.1) Af2 - (C4 + C3) A/"+i < C12. 
If not, then either ([2TT9)) 

(3.2) M^ - (C4 + C3) A/"+i < Cii 
or 

(3.3) M < Cio, 

by the discussion surrounding (|2.15p . 

So we simply must choose (C4 + C3) small enough, say 

(C4 + C3) < £0 

so that the noncompact region defined by (|3.1|) docs not intersect the compact 
regions defined by (|3.2|) and p.3|) . similarly for the noncompact region defined by 
p.2[) . Further, in order to have c-convexity, we must assume that the conditions 
of Lemma 12.61 arc satisfied. The upper bounds in the above alternatives provide 
lower bounds on the Hessian, so we choose C3 small enough so that Lemma 12.61 is 
satisfied. 

Now by the theory of Delanoe [D], Caffarelli [CI] and Urbas [U] we have a 
classical solution to the problem for distance squared 

c°(a;,y) = |a;-y|V2 

in Euclidean space. 

We use the method of continuity. Openness is provided by Theorem 17.6 in GT, 
where we set 

G : C^'^'in) X [0, 1] ^ C°'" (O) X C^'" (dn) 

with 



G{u,t) 



In det 



-cf/{x,T^'\x,Du) -~h{x) + h{T^''>{x,Du))-\ndet cfj {x,T^*\x,Du)) 

h{T^*\x,Du)) 

where the cost function is changing from Euclidean to c as 

c(*) = (1 - i)cO + tc 

and r^*^ defined by 

Dc'-*\x,t'-*\x,Du)) ^Du. 
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Our initial solution uq is smooth , so it satisfies the above estimates p.!) etc) with 
C3C4 = 0. These bounds change continuously with t so D^u must stay in the 
compact components of (|3.ip (|3.2|) and p.3|) . As is standard for this problem, we 
cite [LT] to obtain the C^" estimates. By [GT] Theorem 17.6, we have openness 
in t, and the estimates give us closedness as long as |_D^c*-*^| , iD'^c^*^! < Eq- This 
completes the proof of Theorem 1.1. 

4. Theorem 2 
First we employ a change of coordinates so that 

Proof. Then, on a product of very small balls Bi/x{xq) x Bi/\{xq) we have 

for some C2 near 1, and |Z?'^c| , \D'^c\ < C which may be large but finite. 
We now rescale and consider the following problem on Bi{0) x Bi{0): Let 

be the cost function, and let the distributions to be transported be Gaussians, 
satisfying (al-3) on Bi{0),Bi{0). 

This cost function c'^-* now satisfies the conditions in our first theorem, as we see 
that choosing A large enough will make the third and fourth derivatives arbitrarily 
small. 

It follows by Theorem 1.1 that the solution to this rescaled optimal transporta- 
tion problem is smooth. However, the coordinate change and " change of currency" 
do not change the underlying optimal transportation problem. Thus we also have 
smoothness for the solution of the problem sending 

'"- '^ ABi/A(a;o) 

to 

'"- ^ ABi/a(xo)- 

This completes the proof. D 
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